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Abstract 

We prove an inequality for polynomials applied in a symmetric way 
to non-commuting operators. 



1 Introduction 

J. von Neumann [§] proved an inequality about the norm of a polynomial 
applied to a contraction on a Hilbert space H. Let D be the unit disk and T 
the unit circle in C, and for any polynomial p let \\p\\x be the supremum of 
the modulus of p on the set X. The result is that 

T e B(H),\\T\\ < 1 ^ \\p(T)\\ < \\p\\ w (1.1) 

For polynomials p(z) = p(zi, z 2 , . . . , z n ) = Ys\ a \<N c <* za * n n variables 
we use the standard multi-index notation (where a = (ai, a 2 , ■ ■ ■ , has 
< ctj G Z for 1 < j < n, \a\ = Y^j=i a ji z ° = Yll=i Z J J )- There is an 
obvious way of applying p to an n-tuple T = (T 1; T 2 , . . . ,T n ) of commuting 
operators 2} G B(H) (1 < j < n), namely 

p(T)=p(T 1 ,T 2 ,...,T n )= c ^ a 

\a\<N 
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(with T a = ]\ n j=l Tp and Tf = I). 

T. Ando [2] proved an extension of von Neumann's inequality to pairs of 
commuting contractions. 

Theorem 1.1 (Ando). IfT 1: T 2 e B{H), maxd^, ||T 2 ||) < 1, T X T 2 = T 2 T X 
and p(z) = p(zi, z 2 ) is a polynomial, then 

\\p(T u T 2 )\\<\\ P \y. 

The purpose of this note is to look for analogues of Ando's inequality that 
are satisfied by non- commuting operators. For a polynomial p in n variables 
and an n-tuple of operators T = (T±, . . . ,T n ) we define p sym (T) to be a 
symmetrized version of p applied to T (we make this precise in Section [2]) . 
We are looking for results of the form: 

For all n-tuples T of operators in a certain set, there is a set K\ in C n 
such that 

|bsym(T)|| < |b|k- (1.2) 

and 

For all n-tuples T of operators in a certain set, there is a set K 2 in C n 
and a constant M such that 

|bsym(T)|| < M\\p\\ K2 . (1.3) 



Our main result is: 

Theorem 14.61 There are positive constants M n and R n such that, when- 
ever T = (Ti, T 2 , . . . , T n ) G B(H) n satisfies 



i=l 



and p is a polynomial in n variables, then 

UftymCOII < IIpIIjut (1-4) 

lbs ym (T)|| < M n \\p\\ w . (1.5) 

Moreover, one can choose R 2 = 1.85, = 2.6, M 2 = 4.1 and M 3 = 16.6. 
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2 Tuples of noncommuting contractions 

There are several natural ways one might apply a polynomial p(zi, z 2 ) in two 
variables to pairs T = (Ti,T 2 ) G B(H) 2 of operators. A simple case is for 
polynomials of the form p(z±, z 2 ) = Pi(zi) + p 2 (z 2 ) where we could naturally 
consider p(T\, T 2 ) to mean pi(Ti) + p 2 {T 2 ). 

A recent result of Drury [5] is that if p(z±, z 2 ) = Pi(zi) + p 2 (z 2 ), T 1; T 2 e 
B(H) (no longer necessarily commuting), max(||Ti||, HT2II) < 1, then 

\W*T*)\\ <V2|b|| s2 . (2.1) 

Moreover, Drury [5] shows that the constant \[2 is best possible. 
One way to apply a polynomial p(zx,z 2 ) = YTjk= 

o to two non- 
commuting operators T\ and T2 is by mapping each monomial z{z 2 to the 
average over all possible products of j number of T\ and k number of T 2 , and 
then extend this map by linearity to all polynomials. We use the notation 
Psym {Ti,T 2 ) and the formula 

n j+k 
j,k=0 \ j ) S€V(j+k,j) i=l 

where V(j + k,j) denotes the subsets of {1,2, ... ,j + k} of cardinality j. 
The empty product, which arises for j — k — 0, should be taken as the 
identity operator. The notation Y[{=i T 2 - Xs (i) is intended to mean the ordered 
product 

T 2 - X sW T 2-xs(2) ■ ■ ■ T 2 - xs ( j+k ), 
and Xsi^) denotes the indicator function of S. 

Remarks 2.1. The operation p h-> p sy m{Ti,T 2 ) is not an algebra homomor- 
phism (from polynomials to operators). It is a linear operation and does not 
respect squares in general. 

For example, if p(zx, z 2 ) — z\ + z 2 , then 

p sym (Ti,T 2 ) = T-f + T| 

but for q(zi, z 2 ) = (p(zi, z 2 )) 2 = zf + z 2 + 2zfz 2 we have 

(p syia (Ti, T 2 )f = T\ 4 + T 4 + T 2 x Tl + T 2 2 Tl ± qeym {T u T 2 ) 

in general. 

Similarly for p(z±, z 2 ) = 2z±z 2 and 

q(zi,z 2 ) = (p(z u z 2 )f = Azfz 2 , 
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p sym (T 1 ,T 2 ) = T X T 2 + T 2 T U 

(p sym {T u T 2 )f = T X T 2 T X T 2 + T x TlT x + T 2 T?T 2 + T 2 T X T 2 T X ± g sym (T 1; T 2 ) 
in general. 

However in the very restricted situation that p(zi, z 2 ) = a + fiz\ + jz 2 
and q =p m , then we do have q sym {Ti, T 2 ) = (p sym (Ti, T 2 )) m . 

The symmetrizing idea generalizes in the obvious way to n > 2 variables. 
We will use the notation p syra (T) for n-tuples T G B(H) n for n > 2. 

3 Example 

The analogue of Ando's inequality for n > 3 commuting Hilbert space con- 
tractions and polynomials norms on D n is known to fail (see Varopoulos [10] , 
Crabb & Davie [4], Lotto & Steger [7], Holbrook 0). 

The explicit counterexamples of Kaijser & Varopoulos [ID] , and Crabb 
& Davie [1]) have p(T) nilpotent (and so of spectral radius 0). While the 
examples of Lotto & Steger [7] and Holbrook [6]) do not have this property, 
they are obtained by perturbing examples where p(T) is nilpotent (and so 
p{T) has relatively small spectral radius). 

It is not known whether there is a constant C n so that the multi- variable 
inequality 

lb(T)|| = |b(r 1 ,r 2 ,... J r n )||<c B ||p|| ir (3.1) 

holds for all polynomials p(z) in n variables and for all n-tuples T of com- 
muting Hilbert space contractions. However, it is well-known that a spectral 
radius version of Ando's inequality is true — indeed, it holds in any Banach 
algebra. 

Proposition 3.1. Ifp is a polynomial in n variables and T = (T 1; T 2 , . . . , T n ) 
is an n-tuple of commuting elements in a Banach algebra, each with norm at 
most one, then 

p(p(T))= lim \MT)r\\ 1/m < \\P\\W (3-2) 

m— >oo 

Proof. We consider a fixed n. It follows from the Cauchy integral formula, 
that if maxKjXn ||Ty|| < r < 1, then 

||p(T)|| = ||p(r 1 ,r 2 ,...,r n )||<aiNlir ( 3 - 3 ) 

for a constant C r depending on r (and n). 
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To see this write 

.. „ n n 

P(T) = L. 11*0 n<o - T >r l <*& ^ - ^ 

and estimate with the triangle inequality. This shows that C r = (1 — r)~ n 
will work. 

Applying (13. 3p to powers of p and using the spectral radius formula, we 

get 

p(p(T)) < |b||i«, 

(provided maxi<j<„ ||Tj|| < r < 1). However, for the general case maxi<j<„ 
1, we can apply this to rT to get 

p(p(T)) = lim p{p{rT)) < \\p\\oo. □ 

Example 3.2. Letp(z, w) = (z— w) 2 +2(z+w) + l = z 2 +w 2 — 2zw+2(z+w)+l, 

= /cos(vr/3) sin(vr/3) \ = / 1/2 y/3/2\ 
1 ^sin(7r/3) -cos(tt/3)J \^/2 -1/2 J' 

/ cos(vr/3) -sin(7r/3)\ 
2 y-sin(V/3) -cos(7r/3)y ' 

Note that ||p||d 2 > 1) = 5. To show that ||p||i>2 < 5, consider the 

homogeneous polynomial 

q(zi, z 2 , z 3 ) = z\ + zl + z\- 2z x z 2 - 2z x z 3 - 2z 2 z 3 

and observe first that p(z, w) = q(z,w, —1). Moreover 

IMIb* = IblliP = = Hallos, 

by homogeneity of q and the maximum principle. Holbrook [6J Proposition 
2] gives a proof that ||g||D 3 = 5. 
We have 

p sym (7\,T 2 ) = (T 1 -T 2 ) 2 + 2(T 1 +T 2 ) + I 

° ^ + 2(1 °l+J 



v/3 o y v° - 1 

3 0\ f2 \ A 
3i + \0 —2 / + V 1 



6 
2 



So ||p sym (Ti,T 2 )|| = 6 > 5 = ||p|| D 2. 
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Remark 3.3. The example has hermitian T\ and T 2 and a polynomial with 
real coefficients and yet p(p sym (T 1 ,T 2 )) > \\p\\o 2 - Thus even Proposition 13.11 
does not hold for non-commuting pairs. 

One can show that for the polynomial p of Example I3.2[ one has the 
inequality 

IbsymC^T^H < (1 + 4^)^6.66 

for all contractions T\ and T 2 . This estimate is at least an improvement over 
using the sum of the absolute values of the coefficients of p, so one is led to 
ask how well can one bound ||p sym (T)|| for general pi 

4 IIEC^n < 1 

In this section, we shall consider n-tuples T = (T 1( . . . , T n ) of operators, not 
assumed to be commuting, and we shall make the standing assumption: 

n 

llXl^ll < 1 VCi e D. (4.1) 

t=l 

This will hold, for example, if the condition 

n 
i=l 

holds. We wish to derive bounds on ||p sym (T)||. We start with the following 
lemma: 

Lemma 4.1. If S E B(H) and \\S\\ < 1 then 

+ s)(i - sy 1 ) > 0. 

Proof. 

2K((/ + S)(/-S)- 1 ) 
= {I-S*)- 1 {I + S*) + {I + S){I-S)- 1 

= {i- s*)- 1 [{i + s*){i -s) + (i- s*){i + s)} (/ - sy 1 
= 2(1 - s*y l [i - s* s)\(i - sy 1 
> o. 

□ 
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ft! 



lip(z) = ^2c a z a , define 

v P {z) = E c «fn^ fL3) 

\a\\ 

(as usual, a\ means ai! ■ ■ -a n \). We let A denote the inverse of Y: 



Proposition 4.2. Let T = {T U T 2 , . . . ,T n ) e B(H) n satisfy and p{< 

be a polynomial in n variables. Then 



lbsym(T)|| < ||rp|| r . 

Proof. We first restrict to the case 



(4.4) 



E^ 



3=1 



< 1 



C = (Ci,C 2 ,...,Cn)eT' 1 ^||C-r|| 

and hence by Lemma 14.11 the operator 

oo oo 

(I + ( - T)(I - ( - T)- 1 = (I + ( .T)J2(( - T) j = / + 2^(C-T)^ 



j=0 



3=1 



has positive real part 
K{(,T) 



^((I + C-T^I-C-T)- 1 ) 

oo oo 



3=1 



7=1 



E ^VUCO 



.ai,...,a„=0 

We can compute that for polynomials p(z) = p(zi, 2:2, . . . , z n ), 
Psym (T)= [ Tp(C)K(CT)da(C) 
with da indicating normalised Haar measure on the torus T n (and ( 

(Cl? C2, • • • , Cn))- 

As 

K(C,T)da(0 



is a positive operator valued measure on T n , we then have a positive unital 
linear map C(T n ) B(H) given by / h+ J Jn f(/£)K(£, T) da((). As this map 
is then of norm 1, we can conclude 

lbs ym (T)||<||r P || r . 

For the remaining case sup^ gT n \\( ■ T\\ = 1, we have 

|b sym (T)||= lim |b sym (rT)||<||r P || r . □ 

r— >1~ 

Remark 4.3. The technique of the above proof is derived from methods of 

Now we want to estimate ||rp|Liv. 
Proposition 4.4. For each n > 2 there is a constant M n so that 

||i>||d" < M n \\p\\ w . 



Moreover, 



M 2 < 4.07 
M 3 < 16.6 



Proof. Define 



Then 



m = E fV*- ( 4 - 5 ) 

ai=0,...,a„=0 



r P (z) = / p(c)[J(ziCi,...,^C«)]^(0- (4-6) 

To use (14. 6p . we break J into two parts — the sum Jo where the minimum 
of the «j is 0, and the remaining terms J\. 



oi=l,...,a n =l 

Case: n = 2. Here, 

p(CH(«iCi,^C 2 )<M0 = p{z 1 ,0)+p{0,z 2 )-p{0,0). (4.7) 

T 2 

So the norm of the left-hand side of (14. 7p is dominated by 3||p|| H 2. 
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For J 1; we will use the estimate 




P {0Jl(z 1 Cl,Z 2 C 2 )da{C) < IbHooll^llUi < IbHooll^lllxa. 



We have 




/V _ 9 

V 3 4 
(1.069) 2 . 



< 



(In the penultimate line, we let k = a\ + a 2 ; there are k — 3 terms with this 
sum, and the largest they can be is when either a\ or a 2 is 2.) Adding the 
two estimates, we get M 2 < 4.07. 

Case: n — 3. Again, we estimate the contributions of Jo and J\ separately. 
We have 



where we have had to subtract some terms to avoid double-counting. Thus 
the contribution of Jo is at most 3M 2 + 4. 

To calculate the contribution of J\, we make the following estimate on 
II Jill l 2 ; which is valid for all n > 3: 

We want to bound 



Let k = \a\ in (I4.8j) . Note first that the number of terms for each k is the 
number of ways of writing A; as a sum of n distinct positive integers (order 



p(C)^o(^iCi, z 2 ( 2 , z 3 ( 3 )da{() 

= ryo, z 2 , z 3 ) + \r P ( Zl) o, z 3 ) - p(o, o, z 3 )\ 

+ [Tp( Zl , Z2, 0) - p( Zl , 0, 0) - p(0, 22, 0) + p(0, 0, 0)] 




(4.8) 




a|! _ fc(Jfe-l)---(ife-Ti + 2)' 
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Therefore (14. 8p is bounded by 



^ In- 1 



k=n 

oo 



£ 

k=n 



k(k - 1) • • • (fc - n + 2) 

fc - n + 1 1 
(ra-l)!fcfc(fc -!)•••(& -ri + 2)' 



The terms on the right-hand side of (14.91) decay like l/k n 1 , so the series 
converges for all n > 3. When n = 3, the series is 

£^1) £ < a381)2 ' 

Therefore M 3 < 3M 2 + 4.381 < 16.59. 

We now proceed by induction on n. The contribution from J is dom- 
inated by applying T to the restriction of p to the slices with one or more 
coordinates equal to 0, and these are bounded by the inductive hypothesis. 
The contribution from J\ is bounded by (14.81) . 

□ 

We have proved that the polydisk is an M-spectral set for T; we can make 
the constant one by enlarging the domain. 

Proposition 4.5. There is a constant R n so that 

lirplhr < Iblki-- ( 4 -9) 

Moreover, 

R 2 < 1.85 
R 3 < 2.6 

Proof. Let L(rj) = 2dtJ(rj) — 1. Adding terms that are not conjugate analytic 
powers of ( inside the bracket in (14. 6p will not change the value of the integral, 
so, writing z( for the n-tuple (ziCi, . . . , z n (n), we get 

Tp(z) = [ p(0[L(z()]da(0. (4.10) 

As L is real and has integral 1, if we can choose r n so that if \zi\ < r n for 
each i then L{zQ is non-negative for all (, then its L l norm would equal its 
integral, and so we would get from (I4.10p that 

\Tp(z)\ < 
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Letting R n = l/r n gives (I4.9p . As the series (14.51) converges absolutely for all 
r\ G D n , and L(0) = 1, the existence of some r n now follows by continuity. 

Let us turn now to obtaining quantitative estimates. 

Case: n = 2. Adding terms to J that are not analytic will not affect the 
integral (I4.10p . so let us consider 



3? 



I + 771 
1 - ?7i 



• 9? 



1+V2 
1-^2 



di =1,0:0—1 



Then Z/ has integral 1 and (I4.10p is unchanged if L is replaced by V . So we 
wish to find the largest r so that V is positive on rO 2 . 

It can be checked numerically that r = 0.5406 works, so the best R2 is 
smaller than the reciprocal of 0.5406, which is less than 1.85. 

Case: n = 3. (By hand). 

Our strategy will be to simply estimate each non-constant term in L by a 
function of r, add them up, and see how small r must be for all these terms 
to be less than 1. 

First, let us estimate the terms from Jo — 1, i.e. those terms with either 
one or two of the a^s equal to 0. We can write 

00 J , 

AM- 1 = [<!•+ E J^jf^ + - 

a 2 =l,a s =l V Z 61 

where the ... mean two more terms with the indices (1,2,3) permuted. 
Therefore if each \rji\ < r, we have 



2K[J (7?) - 1] < 6r + 6 E 



0^3! r a 2 +q 3 



< 



J [a 2 + a 3 )\ 

a 2 =l,C(3=l v ' 

00 

6r + 6(r 2 /2 + 2r 3 /3) + 6 ^ - 1) 



fc=4 



k(k-l) 



6r + 6(r 2 /2 + 2r 3 /3) + 6[- log(l - r) - r - r 2 /2 - r 3 /3]. 



The contribution to L from J 1; where all the indices are at least 1, is at 
most 

00 



2&[Ji 



< 



< 



2 E 

ai =1,0:2=1,03=1 

'ife - 1 



Oil 



at 



2E 



r 



k=3 



k(k-l) 



+ 2[log(l -r) + r + r 2 /2]. 
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Adding the two terms together, we get 



L(v) > 1 - [2r + r 2 + 2r 3 + 



1 — r 



log(l -r)], 



and this is positive if r < .152. So letting R% be the reciprocal of this root, 
which is less than 6.6, will work. 

Case: n = 3. (Computer-aided) 

As in the case n = 2, we consider the kernel 





"I + 771" 


■ 3fJ 


"1 + ^2" 




"1 + 773 




1 - 77! _ 




.1 - r] 2 _ 




.1 - ?73 



"1 = 1, «2 = 1, 03=0 



Q'! 



an 



(Note that there is a plus in the last factor to keep V real.) Again, a computer 
search can find r so that V is positive on rO 3 , and r = .39 works, so R3 < 2.6. 

□ 

Combining Propositions 14. 2\ 14.41 and I4.5[ we get the main result of this 
section. 

Theorem 4.6. There are positive constants M n and R n such that whenever 
T = (Ti,T 2 , . . . ,T n ) G B(H) n satisfies fl^.l| ) and p(z) is a polynomial in n 
variables, then 



lbsym(T)|| < 
lbsym(T)|| < 

Moreover, one can choose R2 = 1.85, R3 



\P\\r 
M, 



2.6, Mo = 4.1 and M, 



(4.11) 
(4.12) 

16.6. 



Remark 4.7. Another way to estimate ||p sym (T)||, under the assumption (14.21) . 
would be to crash through with absolute values. Let A n = {z E C n : 
YTj=i \ z j\ — 1} an d let r n denote the Bohr radius of A n , i.e. the largest r such 
that whenever p(z) = ^2 c aZ a has modulus one on A n , then q(z) = \ c a\z a 
has modulus bounded by one on r A n . One then has the estimate that, under 
the hypothesis f )4.2p . and writing C n = l/r n , 

lbs ym (T)|| < ||g|| An < ||p||c ft A B . (4.13) 
It was shown by L. Aizenberg [H Thm. 9] that 

1 



3eV3 



< r r . 



1 

< -. 

~ 3 



So the estimate in ( 14. lip for pairs satisfying (14. 2 \ does not follow from (I4.13p . 
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5 n-tuples of contractions 

In an attempt to use the above technique for tuples T e B(H) n such that 
maxi<j<„ \\Tj\\ < 1, we consider restricting ( to belong to A n , and we replace 
cr by some probability measure fi supported on A n . 
Suppose we can find some function q such that 

A,(q)(z) := / g (C)»l±^£ dM (0 (5.1) 

equals We do not actually need g to be a polynomial; having an abso- 
lutely convergent power series on A n (in ( and () is enough. 

Lemma 5.1. With notation as above, assume A M (g) = p and thatT £ B(H) n 
is an n-tuple of contractions. Then 

(/llsupptfyz) 

< sup{|g(z)| : z G A n }. 

Proof. We assume first that maxi<j< n ||Tj|| < 1 and use the notation K((, T) 
from the proof of Proposition 14.21 (which is permissible as \\( ■ T\\ < 1 for 
C e A n ). We have 

(A,g) sym (T)= / q(()K{,T) da(() 

and hence the inequality ||(p) syin CO|| < ||g|| S uppt(/x) follows as in the previous 
proof. 

If maxi< 3 -< n \\Tj\\ = 1, we deduce the result from || (p) sym (rT) || < ||o||a„ 
for < r < 1. □ 

Remark 5.2. For an arbitrary measure fi, there might be no q such that 
A M (g) = p. If fi is chosen to be circularly symmetric, though, one gets 

As long as none of the moments on the right of (15.21) vanish, inverting A^ is 
now straightforward. 

To make use of the lemma to bound p sym (T) we need to find a way to 
choose another polynomial q and a fi on A n so that p = A M g and ||g||A n is 
small. We do not know a good way to do this. 

Question 1. What is the smallest constant R n such that, for every n-tuple T 
of contractions and every polynomial p, one has 

lb sym (T)|| < |b|L 5»? (5.3) 



\a\ 



«i! . . . a n \ 



(5.2) 
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We do not know if one can choose R n smaller than the reciprocal of the 
Bohr radius of the polydisk, even when n — 2. 

Question 2. Is there a constant M n such that, for every n-tuple T of con- 
tractions and every polynomial p, one has 

IbsymCnil <M n \\ P \\ W <? (5.4) 
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